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(54) Method for encoding data In COFDM systems 



(57) In a coded orthogonal frequency division multi- 
plex (COFDM) system n-bit data words are encoded as 
2 m -symbol code words (binary, quaternary, octary. etc.). 
The code words are selected for desired low peak-to- 
mean envelope power ratio (PMEPR) characteristics of 
transmissions over a COFDM channel, from a set of 
cosets of a linear sub-code of a code having a specified 
generator matrix. The code words thus identified by the 
procedure described can, even for values of m in 
excess of 3, simultaneously limit the PMEPR to a known 
value, provide specified error control characteristics, be 
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implemented in a feasible manner using analytical cir- 
cuit techniques (e.g. with combinatorial logic), and 
include sufficiently many different code words to enable 
data to be transferred at useful rates. In particular, sets 
of cosets can be identified without calculating the maxi- 
mum peak envelope power over each coset Various 
selections of code words can be made, enabling a 
reduced error detection capability to be accepted in 
order to obtain a higher data transfer rate. 
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Description 

Technical Field 

5 [0001 ] This invention relates to methods and apparatus for encoding data, and to communications systems incorpo- 
rating such methods and apparatus; in particular, though not exclusively, the invention relates to such methods, appa- 
ratus and systems for transmitting data using multi-carrier frequency division multiplexing, such as coded orthogonal 
frequency division multiplexing (COFDM). 

10 Background Art 

[0002] The increasingly widespread use of powerful computer-based equipment and appliances is resulting in rapidly 
increasing requirements for high-speed, high-capacity communications, based on both wireline and wireless tech- 
niques. One way of accommodating this increase is to design communications systems for operation at higher and 

is higher symbol rates. However, a increase in symbol rate typically involves a corresponding decrease in inter-symbol 
spacing, and this in turn can lead to greater intersymbol interference (ISI) and consequential errors in symbols as 
received. In the case of wireline links ISI can arise, for example, from signal reflections at imperfectly matched inter- 
faces between different portions of the communications link. For wireless links a significant source of ISI is multi-path 
propagation involving signal reflections from objects (such as buildings) in the space traversed by the signals; mobile 

20 wireless communications can be especially susceptible to ISI from this source, because movement of a mobile unit 
using the link can cause continuous and complex variation in the multiple propagation paths extending to the mobile 
unit. 

[0003] A technique which is being actively studied and developed to increase communications system capacity and 
counteract ISI is multi-carrier (or muKHone) frequency division multiplex operation, such as COFDM operation. In a 

25 COFDM system multiple symbols are transmitted simultaneously, each symbol controlling the modulation (e.g. by 
phase-shift keying) of a respective one of multiple carrier signals having closely-spaced frequencies. By choosing the 
carrier-frequency spacing to have an appropriate relationship to the rate at which symbols are transmitted on each car- 
rier, it is possible to ensure that each individual carrier can be demodulated without interference from the carriers adja- 
cent to it (in which case the carriers are said to be orthogonal to one another). The set of symbols transmitted 

30 simultaneously on respective carriers during a symbol period (a code word) is chosen to encode a group of data sym- 
bols (a data word) and includes redundancy (i.e. the code word has more symbols than the data word); this redundancy 
is used upon reception for detection of errors in the symbols as received, and, if desired, for correction of those errors. 
[0004] COFDM systems counteract ISI in part because the transmission of multiple symbols in parallel enables a 
desired data rate to be attained with a lower symbol rate than would be necessary if a serial system were used to trans- 

35 mit individual symbols in succession. A lower symbol rate implies a longer duration for each symbol and thus a longer 
period between symbol transitions, which reduces the impact of ISI. The error detection and correction capabilities pro- 
vided by the inclusion of redundancy enhance the resistance to data corruption, by enabling some errors which occur 
in received symbols owing to signal fading and other causes to be detected and corrected. 

[0005] However, present COFDM systems encounter difficulties in obtaining the maximum potential benefit for a par- 
40 ticular transmitted signal power,, owing to limitations arising from the typical peak-to-mean envelope power ratio 
(PMEPR) of the complex-valued signal from which the actual transmitted signal is derived. Because this transmitted 
signal is effectively the sum of several signals at closely spaced frequencies and with respective phases which change 
at each symbol transition, the overall signal tends to exhibit transient pronounced peaks separated by substantially 
longer intervals of much lower though still varying amplitude. The transmitting equipment must be adjusted to reproduce 
45 the peaks without clipping or other distortion, so for much of the time, in the intervals between peaks, that equipment is 
operating at a signal level well below its maximum capability. As a result the geographical range of the equipment is sig- 
nificantly lower than would be expected for its nominal power rating, or alternatively a much more powerful equipment 
must be provided to cover the desired range and then operated inefficiently. 

[0006] It is known that this problem can in principle be alleviated by careful selection of the code words which encode 
so the possible data words (for example in •Block coding scheme for reduction of peak to mean envelope power ratio of 
murticarrier transmission schemes* by AE. Jones, T.A. Wilkinson and S.K. Barton. Electronics Letters, 8th December 
1994, vol.30, no.25, pp. 2098-2099). However, in selecting code words to reduce the PMEPR it is important also to 
ensure the code has good error correction properties. Furthermore, implementation of a high-throughput communica- 
tions system desirably involves the use of long code words and of rapid encoding and decoding procedures; the use of 
55 simple look-up tables, though readily implemented, does not lend itself to rapid operation using compact (and low- 
power, inexpensive) circuitry, especially during decoding, if the number of valid code words to be checked is large, as 
is typically the case with long code words (e.g. more than about 16 symbols). It is therefore preferable for encoding and 
decoding operations to be definable in terms of some analytical procedure which for example performs combinatorial 
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logic on the data word or code word symbols to convert one to the other, or at least restricts the size of any look-up table 
which is used. 

[0007] An additional difficulty is presented by the sheer quantity of possible code words from which a selection may 
be made to define a code for use in a typical system. As the number of symbols in a code word increases the number 

5 of possible code wads increases much faster, even for binary symbols; for symbols having just three (ternary) or four 
(quaternary) possible values, the number of possible codes available becomes so huge even for quite short code words 
that a simple exhaustive search through all possible code words is not feasible in a realistic time even just to minimize 
the PMEPR. Attempting to satisfy error correction and implementational requirements as well merely adds to the diffi- 
culty of such a search, especially as it will not even be known whether there actually exists a code word set which is of 

10 the desired size and which simultaneously satisfies preferred criteria relating to PMEPR, error detection or correction 
and ease of encoding and decoding. This difficulty is further exacerbated by the desirability of having a large number 
of code words available for use in the chosen code, in order to be able to encode a large number of different data words 
(i.e. data words containing relatively may symbols) ad thus attain an acceptably high data transfer rate. 
[0008] It is known that certain kinds of codes may have some desirable properties. Thus the paper 'Synthesis of 

75 Power Efficient Multitone Signals with Flat Amplitude Spectrum' by B.M. Popovic, IEEE Transactions on 
Communications, vol.39, no.7, July 1991. pp. 1031-1033 indicates that any binary or polyphase complementary 
sequence can be used to construct a multitone signal such that the crest factor of the real-valued signal is less than or 
equal to 6 dBV (which corresponds to the PMEPR of the complex-valued signal being less than or equal to 3 dB). Exam- 
ples of these sequences have been described, as in the case of binary Golay complementary sequence pairs fCom- 

20 plementary series' by M.J.E. Golay, IRE Transactions on Information Theory, voUT-7, Apr. 1961 , pp. 82-87). However, 
Popovic's paper is directed to situations such as mufti path fading measurements and multitone jamming, in which a sin- 
gle code word can be used continuously in generating the required signal. Accordingly Popovic gives no guidance on 
how multiple such sequences could be identified just to attain the 6 dBV limit on PMEPR in a practical code for convey- 
ing information; and no mention whatsoever is made of the other requirements mentioned above for a code suitable for 

25 practical implementation. Indeed Popovic states that "the construction of multitone signals having minimum peak-to- 
. peak amplitude is an old problem that still has no analytical solution". 

[0009] It should be noted that Golay complementary sequence pairs and codes defined using them are different from 
and should not be confused with so-called Golay codes; Golay complementary sequence pairs and Golay codes were 
first defined, independently of one another, by the same researcher and for that reason alone both are referred to by his 
30 name. 

[0010] Methods and apparatus for encoding and decoding are described in International patent application no. 
PCT/GB 97/02 464. in which a substantial number of different code words are identifiable in practice, despite the poten- 
tially huge search space of possible code words, and which provide a predictable low limit on PMEPR, have specified 
error detection and correction properties and are feasible for implementation in high-speed circuitry. Some of the tech- 
35 niques described therein, for ordering cosets of code words according to associated PMEPR. can involve substantial 
amounts of computation. 

[001 1 ] It is an object of this invention to enable identification of large numbers of cosets of code words which will yield 
a PMEPR not greater than a predetermined threshold, without the need for extensive computation of peak envelope 
power over each coset 

40 

Disclosure of Invention 

[0012] According to one aspect of this invention there is provided a method of encoding data words as 2 m -symbol 
code words where m ^ 3, each code word symbol having q different possible values where q is an even integer greater 
45 than 1 , for transmission using mufti-carrier frequency division multiplexing, such as coded orthogonal frequency division 
multiplexing (COFDM), wherein data words are encoded in accordance with their value and code words selected in 
complete cosets, for a peak-to-mean envelope power ratio of said transmission not greater than a predetermined 
threshold of 2?* 1 for any fixed integer p where 1 £ p z m-2, from a set of one or more cosets of a linear sub-code of a 
code having a generator matrix as follows: 

so 

((2.0+1)x2 m )/2 m ((2.1+1)x2 /7, )/2' 7, .. ((2.z+1)x2 m )/2 m .. ((2.(2 m -1)+1)x2 m /2 m 

((2.0+1 )x2 m ' 1 )/2 m - 1 ((2.1+1)x2' 7, - 1 )/2 m - 1 ..((2.z+1)x2 m * 1 )/2 /Iv1 ..((2.(2 /TI -1)+1)x2 / "* 1 )/2 m ' 1 

((2.0+1 )x2'/2' ((2.1+1)x2")/2' .. ((2.z+1)x2>72' .. «2.(2 m -1)+1)x2> r )/2> r 

((2.0+1)x2 1 )/2 1 {(2.U^x2^)/2 A .. ((2.z+1)x2 1 )/2 1 .. ((2.(2 m -1)+1)x2 1 )/2 1 
55 ((2.0+1)x2°y2° ((2.1+1)x2°)/2° ((2.z+1)x2°)/2 0 .. ((2.(2'"-1)+1)x2 0 )/2 0 

where z has integer values from 0 to 2 m -1 . y has integer values from m to 0, x indicates bitwise multiplication of factors 
expressed as (m + 1)-brt binary numbers, and the result of division is expressed as a single digit; 
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wherein all coset representatives of cosets in said set of cosets have the form 
( Q /2)(XrX 2 )0(q/2)(X 2 *X3)©...e(Q/2)(X m . p . 1 *X n) . p ; 

®v 1<1 (xrx m . p ^ 1 )e 1 ^(x 2 *x m .p +1 )©...ev 1>m .p(x 0 ,yx m . p . 1 ) 

5 ©V 21 *X m . p+2 )©V 2f 2(X2*X m .p +2 )e...©V 2im . p (X m . 

^p.i(xrx m )© p ^(X 2 *x m )©...®v Ptm . p (X m yx m ) 

®«1 ^(X m . p *i **m-p+2) 

©W1 ,3(X m . p ^1 *X m . p ^)ew 2t3 P( m . p+2 *X m . p> 3) 

©Wl ^P^m-p+l *X m . p ^)©W 2i 4PC m . p ^2*X^ p+ 4)©W 3 4 P< m . p+ 3*X m . p ^4) 

TO ©W! pfX,,,^ *X m) ©W 2tP (X m . fH . 2 *X m )e. ..©W^ lP CX/n>1 %n) 

or a form derived therefrom by a permutation of the subscripts of the terms to X,„, where X 1 to X m are row labels of 
the first m rows in the generator matrix, terms of the form v p tm .fJQt m . p *X m ) indicate derivation of a word by symbol-wise 
multiplication of rows such as X m . p and X m followed by symbol-wise multiplication modulo q of such word by a respec- 
15 tive coefficient such as v pm . p © indicates symbol-wise addition of words modulo q, and the coefficients v, y for 1 s / £ 
p and 1 £ j z m-p and the coefficients w$ for 1 £ i<j £ p have integer values chosen from the range 0 to q-1 . 

Brief Description of Drawings 

20 [0013] Methods and apparatus fa encoding data in accordance with this invention in communications systems for 
transmitting data using coded orthogonal frequency division multiplexing (COFDM) will now be described, by way of 
example, with reference to the accompanying drawings in which:- 

Figure 1 illustrates muttipath propagation between a base station and a mobile communications device; 

25 Figure 2 is a schematic diagram showing the effect of muttipath propagation on propagation of a single sym- 

bol; 

Figure 3 is a schematic diagram showing intersymbol interference arising from muttipath propagation; 

Figures 4a to 4/7 are schematic diagrams showing reduction in intersymbol interference by the use of multi-carrier 
frequency division multiplexing; 
30 Rgure 5 shows an encoder and modulator for use in a COFDM system; 

Figure 6 is a block schematic diagram of a COFDM system incorporating the encoder and modulator of Rg- 

ure 5; 

Rgure 7 illustrates the general form of a COFDM signal waveform without any limitation of PMEPR; and 

Rgure 8 shows an encoder for encoding 1 0-bit data words as 1 6-bit code words. 

35 

Pesj Mode for Carrying Out the Invention, Industrial Applicability 

[0014] Rgures 1 to 3 illustrate in a schematic manner the problem of muttipath propagation in the context of mobile 
communications in an urban environment. A base station 1 0 communicates by wireless, e.g. at microwave wavelengths. 

AO with mobile communications devices such as a mobile telephone 12. There is a direct line-of-sight signal path 14 
between the base station 10 and the telephone 12. and, owing to the presence of multiple reflectors (for microwave sig- 
nals) in the form of buildings and other structures, there are also multiple reflected signal paths 16. These paths are 
typically of differing lengths and thus involve differing propagation delays upon arrival at their destination. The overall 
signal as received at for example, the mobile telephone 12 is the sum of the various signals arriving at the telephone 

45 via the different pains. 

[0015] The transmitted signal may be considered as comprising a succession of symbols modulating a carrier wave 
(e.g. by phase-shift keying). As shown in Rgure 2, a consequence of signals traversing multiple paths with different 
propagation delays is that the composite signal 18 as received for a symbol is not an accurate replica of the signal 20 
as transmitted for that symbol. In particular the received composite signal is typically longer in duration than the trans- 

50 mitted signal, and has a trailing portion with a signif icant amplitude after the point when an accurate replica of the trans- 
mitted signal would have terminated. If this trailing portion extends for a significant fraction of a symbol period, as shown 
in Figure 3 for the pulse nominally terminating at time T. the trailing portion will overlap and interfere with reception of 
the following symbol. As a result that following symbol may be inconectly received, i.e. the value detected for it in the 
receiver may not correctly represent the value as transmitted, even in the absence of other noise sources. Signal cor- 

55 ruption of this kind is known as intersymbol interference (ISI). 

[0016] The effects of ISI can be alleviated by extending the duration of the transmitted symbols, as shown in Figure 
4a, so that each symbol as received extends beyond the trailing portion arising from multipath propagation delays. To 
counteract the reduction in symbol rate which would ensue, it has been proposed to transmit multiple symbols simulta- 
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neously. each modulating a respective one of multiple carrier waves. Thus if eight carriers; are used, as shown in Fig- 
ures 4a to 4h, the symbol period on each carrier can be eight times that which would be necessary with a single earner 
(as shown in Figure 3) while maintaining the same overall symbol rate. If the frequency spacing between the multiple 
carriers is an integer multiple of the inverse of the symbol period, it is possible to recover the individual symbol stream 
5 modulating each of the carriers without interference from the other carriers. This technique is known as orthogonal fre- 
quency division multiplexing (OFDM). 

[0017] In addition to the use of multiple carriers, it has also been proposed to provide additional error detection and 
correction capabilities by the use of block encoding. Thus, as shown in Figure 5. a serial stream of data symbols (binary 
symbols in this example) is nationally divided into successive groups or words each containing five symbols. A group of 

w five symbols is input to a five-bit to eight-bit (5B/8B) encoder 30 which outputs a corresponding eight-bit code word. This 
code word is applied to a bank of eight phase shift controllers 32 which control a bank of eight oscillators 34 producing 
carrier signals at eight successive frequencies at intervals Af apart Each individual bit in the code word causes a 
respective phase shift controller to impose either a positive or negative phase shift A<J> to modulate the carrier signal pro- 
duced by the respective oscillator, depending on the current value of that bit. 

is [0018] The modulated outputs of the oscillators 34 are combined in an adder 36 to produce the transmitted signal. In 
practice the functions of the oscillators 34 and the adder 36 are conveniently combined in a digital signal processor 
implementing an inverse fast Fourier transform (IFFT). as indicated by the dashed box 38. Thus, as shown schemati- 
cally in Figure 6, data to be transmitted are received in block 40 and converted to parallel form in a serial-to-parallel con- 
verter 42. The parallel data are encoded in encoder 30. as described above, and the encoded data control generation 

20 of the multi-carrier signal in the modulator/lFFT 32/38. This multi-carrier signal is converted to analogue form in a D-A 
converter 44, amplified and transmitted by a transmitter 46. 

[0019] Reception is essentially a reverse of this process: the signal is received and amplified by a receiver 48, and 
then converted to digital form by an A-D converter 50. The Digital signal is supplied to a demodulator 52 which com- 
prises a digital signal processor implementing the fast Fourier transform (FFT) to separate and demodulate the compo- 
25 nent signals in the composite multi-carrier signal. The demodulated data are passed to a decoder 54 which reverses 
the encoding applied by the encoder 30, and typically also performs error detection and correction functions. Thereafter 
a parallel-to-serial converter 56 changes the data into serial form to be output at block 58. 

[0020] The choice of code which is implemented by the encoder 30 has significant implications for the efficient oper- 
ation of the transmitter 46. A multi-carrier signal comprises the sum of several sinusoidal waves with equal amplitudes 

30 and equally spaced frequencies. As a result, the maximum absolute value of the composite signal is strongly sensitive 
to the relative phases of the component sinusoids. These relative phases are changed at the end of each symbol period 
by the phase shift controllers 32. according to the value of the code word to be transmitted during the next symbol 
period. Depending on the particular set of relative phases, and therefore on the value of the current code word, the 
amplitude of the transmitted signal will have occasional pronounced peaks together with intervals of much lower though 

35 still varying value, as illustrated in Figure 7. Thus the peak-to-mean envelope power ratio of the transmitted signal is 
relatively high. 

[0021 ] For such a signal to be transmitted without distortion, it is necessary for the transmitter 46 to use a linear ampli- 
. f ier and to operate it in a relatively inefficient manner, so that most of its dynamic range is unused except tor the infre- 
quent occasions when the amplitude of the transmitted signal reaches its peak value. 

40 [0022] It has been proposed to reduce this problem by appropriate choice of the set of code words which are actually 
used to encode data for transmission. The use of block coding to avoid the use of code words which produce extreme 
values of PMEPR has been suggested in ^Minimisation of the peak to mean envelope power ratio of multicarrier trans- 
mission schemes by block coding' by T.A. Wilkinson and A.E. Jones. 1995 IEEE 45th Vehicular Technology 
Conference, July 1995. pp. 825-829; although this proposal indicates that reductions in PMEPR are possible, it also 

45 identifies the difficulty of selecting a set of code words which both enables efficient implementation of coding and 
decoding and provides desired capabilities for detecting and correcting errors in the demodulated signal. Possible ways 
of addressing this problem are discussed, but no actual solution is presented. A modification to this approach is dis- 
closed in 'Combined coding for error control and increased robustness to system nonlinearities in OFDM' by the same 
authors in 1996 IEEE 46th Vehicular Technology Conference, April 1996, pp. 904-908. In this modification a linear block 

so code is chosen to provide desired properties of error correction and detection; it is proposed then to exploit the linearity 
and redundancy in the code to transform the code values systematically to new values which reduce the PMEPR but 
have equivalent error detection and correction properties. Examples are given for 4B/7B, 4B/8B and 1 1 B/15B codes, in 
which the required transformation is identified by a procedure involving listing all possible code word values in order of 
the PMEPR for a mufti -carrier signal modulated in accordance with each code word. Although such an exhaustive- 

55 search approach is feasible for relatively short code words (e.g. of the order of 15 bits), the number of possible code 
words to be checked increases very rapidly with code word length. Thus, even for a 32-b'rt code word, in excess of 
4.000,000.000 code words would potentially need to be listed, and this would be at best extremely time-consuming and 
in practice possibly prohibitive. The obstacle presented by this requirement is potentially increased by the absence of 
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any guarantee that an appropriate subset of code words actually exists within the set being examined. 
[0023] Popovic, in the paper cited earlier, proposes the use of a single binary or polyphase complementary sequence. 
However, the 1996 paper mentioned above suggests that such sequences are not readily amenable in relation to their 
error correction/detection capability. 
5 [0024] The patent application referenced above describes ways of identifying a set containing many different code 
words all of which satisfy desired criteria regarding error correction and detection properties, the maximum PMEPR of 
mufti-carrier signals modulated in accordance with those code words, and ease and efficiency of implementation of a 
practical encoder and decoder (for example including use of combinatorial logic). 

[0025] As described in that patent application, a set of code words is constructed by combining a base' code, derived 
10 from a 'generator matrix', with one or more 'coset representatives'. For example, in the case of 16-bit (binary) code 
words the generator matrix is: 

(0000 0000 1111 1111) (X,) 

(0000 1111 0000 1111) (Xg) 

75 (0011 0011 0011 0011) (X3) 

(0101 0101 0101 0101) (X4) 

(1111 1111 1111 1111) (X 1 ) 

The 1 6-bit base code comprises all thirty-two linear combinations of rows of this generator matrix, that is combinations 

20 of the form 

a 1 X 1 © a 2 X 2 e a 3 X 3 © 84X4 © aid 
where row coefficients a t , a 2 . a 3 , 34 and a' each take values of 0 or 1 . This base code is linear, i.e. the result of a bitwise 
exdusive-OR operation (modulo-2 addition) on any two of its code words is another code word in the coded 
[0026] A corresponding generator array is created by combining the 16-bit generator matrix with additional rows (in 

25 this case six) derived by bitwise multiplication of all possible pairs of the rows X, , X 2 , X 3 and X4: 

(0000 0000 0000 1111) (X! *X2) 
(0000 0000 001 1 001 1) (Xt *X3) 

(oooo 0000 0101 0101) (x^y^) 

30 (0000 001 1 0000 001 1) (X 2 *X<j) 

(0000 0101 0000 0101) (x 2 *y*4) 

(0001 0001 0001 0001) (X 3 %j) 

Twelve particular coset representatives (for example, 0001 0010 0001 1 101) constructed in a specific way from these 
35 additional rows are each combined by bitwise exclusive-OR operations with all code words in the 16-bit base code to 
produce the final desired set of code words, constituting a set of cosets of the base code. 

[0027] Those code words constitute a set of cosets of a linear sub-code (which may be the whole code) having a gen- 
erator matrix comprising the rows X 1( X^ X 3 , X4 and X\ Using a base code of this kind provides desired error detection 
and correction properties. Selection of a linear sub-code ensures that encoding and decoding within each coset can be 

40 implemented compactly, for example using combinatorial logic A coset can in general be considered to be a systematic 
shift or translation of the base code to a different part of the overall space of possible code words. Use of cosets of the 
linear sub-code enables avoidance of certain undesired code words (such as all zeroes and all ones) which would result 
in particularly high values of PMEPR. but can also enable the enor control properties of the base code to be retained 
to some extent Choosing the code words from a set of such cosets allows good control of PMEPR because each coset 

45 of the base code tends to contain code words having broadly similar associated PM EP Rs. In particular cases where the 
coset representatives have a form descrfced in the patent application referenced above, it can be shown that the 
PMEPR does not exceed 3 dB; the minimum Hamming distance (i.e. the minimum number of symbols whose value 
must be altered, for example by interference, to transform a valid code word containing 2 m binary symbols into some 
other valid code word) is 2 m ' 2 . 

so [0028] In more general terms the generator matrix for a particular code word length 2 m is: 

((2.0+1 )x2" , )/2 m ((2.1+1)x2 n> /2 / " .. ((2.2+1)x2 m /2 m .. ((2.(2 ,n -1)+1)x2 m )/2 / " 

((2.0+1)x2 m1 )/2 nv1 ((2.1+1)x2 / "- 1 )/2 /7, - 1 ((2.2+1)x2 m l )/2 m1 .. ((2.(2 m -1)+1)x2 /7M y2 m ' 1 

((2.0+1 )x2")/2 y ((2.1+1)x20/2* .. ((2.z+1)x2^ ((2.(2 m -1)+1)x20/2^ 

55 ((2.0+1)x2 1 )/2 1 ((2.1+1)x2 1 V2 1 .. ((2.z+1)x2')/2 1 ((2.(2 m -1)+1)x2 1 )/2 1 

((2.0+1 )x2°)/2 0 ((2.1+1)x2°)/2° .. ((2.2+1)x2°/2° .. ((2.(2 m -1)+1)x2 0 V2 0 

where z has integer values from 0 to 2 m -1, y has integer values from m to 0. and x indicates bitwise multiplication (bit- 
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wise logical AND) of two factors (such as (2.2+1) and both expressed as (m+1)-bit binary numbers, and the result 
of division is expressed as a single digit The first line of this matrix (y=m) gives the row X t , the second line (y=m-l) 
gives the row X 2 , the penultimate line (y=1) gives the row X m and the f inal line (y=0) gives the row X*. 
[0029] Thus, for example, for a code word length of sixty-four brts (m=6), the first symbol (2=0) on the second row X 2 
5 (y=/7?-1=5) is 

((2.0+1)x2 m - 1 V2 /7) : 1 
that is (1 x 32)/32 

Expressing the factors involved in the bitwise murtipOcation as (m+1)-bit binary numbers results in 
w (0000001 x 010000Q)/32 

which is (00O0O0OV32 or (expressed as a single digit) 0. 

The second symbol (2=1) on the second row is ((2.1+1) x 2 m * 1 )/ m ' 1 which is (3 x 32V32. which yields (0000011 x 
0100000)/32. or 0 again. The sixteenth symbol (2=15) on this row is ((2.15+1) x 2 m - 1 )/2 m1 which is (31 x 32)/32. which 
is also yields (001 1 1 1 1 x 0100000)/32. or 0. 

[0030] The seventeenth symbol (2=16) is ((2.16+1) x 2 m * 1 )/2 m - 1 which is (33 x 32)/32. This yields (0100001 x 
0100000)/32, which is (0100000)/32 or 1. The same result is obtained for the eighteenth through to the thirty-second 
symbols, 

[0031 ] For the thirty-third symbol (2=32) the matrix gives ((2.32+1) x 2 m ^)/2 m '^ which is (65 x 32)/32 and this yields 
: 20 (1000001 x 0100000)/32. that is 0. A result of 0 is likewise obtained for the following fifteen symbols. 

[0032] The forty-ninth symbol (2=48) is given by the expression ((2.48+1) x 2 m ' 1 )/2 m1 which is (97 x 32)/32 giving 
(1100001 x0100000)/32. or 1, which is the result for the final fifteen symbols as well. Thus the second row in its entirety 
is 

25 (0000 0000 0000 0000 1111 1111 1111 1111 0000 0000 0000 0000 1111 1111 1111 1111) 

[0033] For the third row X 3 (y=m-2=4) the first symbol is ((2.0+1) x2 m " 2 )/2 m - 2 that is (1 x 16)/1 6 which is (0000001 x 
0010000)/16, or 0; the same result is produced for the next seven symbols. For the ninth symbol the expression is 
((2.8+1) x 2 m - 2 )/2 / "* 2 that is (17 x 16)/16 and this yields (0010001 x 0010000)/16. or 1 . and similarly for the tenth to six- 
30 teenth symbols. Thereafter the symbols continue in alternating blocks of eight zeroes and eight ones for the rest of the 
row: 

(0000 00001111 1111 0000 0000 1111 1111 0000 0000 1111 1111 0000 0000 1111 1111) 

35 The other rows of the generator matrix for 64-bit code words would be derived in similar manner. 

[0034] The 2 m -bit base code comprises all linear combinations of rows of the generator matrix, that is combinations 
of the form 

a^ © a 2 X 2 e a 3 X 3 © .... © a m X m 0 aX 
where row coefficients a 1t a 2 , a 3 , .... a m and a* each take values of 0 or 1. The corresponding generator array is created 
40 by combining the 2 /n «bit generator matrix with additional rows derived by bitwise multiplication of all possible pairs ot 

the rows X 1f X 2 , X 3 X m . Particular coset representatives derived from these additional rows in a specific way are 

( w ; _ each combined by bitwise exclusive-OR operations with all code words in the 2 m -brt base code to produce the required 

code words, constituting a set of cosets of the base code. 

[0035] This technique is also applicable to coding with any higher number q of symbol values which is an integer mul- 
45 tiple of two. such as quaternary (q=4), hexary (q=6), octary (q=8) and higher orders. In such cases the same generator 
matrix is used as described above, e.g. for a code word containing 8 symbols (2 m =8 with m=3) : 

(0000 1111) (X0 
(0011 0011) 
so (0101 0101) (X 3 ) 

(11111111) (X*) 

[0036] An 8-symbol quaternary base code (four possible symbol values) comprises all 256 linear combinations of the 
rows of this generator matrix, that is combinations of the form 
55 a 1 X 1 © a 2 X 2 © a 3 X 3 © aX 

where © now indicates symbol-wise addition modulo q (i.e. modulo 4 for quaternary) and row coefficients a 1f a 2 , a 3 and 
a' each take all integer values from 0 to q-1 (i.e. 0 to 3 tor quaternary). This base code is linear, i.e. the result of symbol- 
wise addition modulo 4 on any two of its code words is another code word in the code. The corresponding generator 
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array is created by combining the generator matrix with additional rows. These additional ro.vs are derived cy symbol- 
wise multiplication modulo 4 of all possible pairs of the rows X A , X 2 and X 3 , that is X, *X 2 , X, *X 3 and X 2 *X 3 , Thus for 8- 
symbol quaternary coding the additional rows are: 

5 (0000 0011) (X^Xg) 

(0000 0101) (X^XJ 
(0001 0001) (X 2 *X3) 

[0037] Likewise, a 2™-symbol quaternary base code comprises all linear combinations of rows of the generator 

io matrix, that is combinations of the form 

a^ © a 2 X 2 © a 3 X 3 ©....© a m X m © aX 
where row coefficients a lt a 2 , a 3 .... a m and a' each take values of 0, 1, 2 or 3. The corresponding generator array is 
created by combining the 2 m -symbol generator matrix with additional rows derived by symbol-wise multiplication mod- 
ulo 4 of all possible pairs of the rows X, , X 2 , X 3 X m . Particular coset representatives derived from these additional 

is rows in a specific way are each combined by symbol-wise addition modulo 4 with all code words in the 2 m -symbol base 
code to produce the required code words. 

[0038] Where the number of symbol values is q. the base code comprises all linear combinations of rows of the gen- 
erator matrix, that is combinations of the form 

a l X 1 © a 2 X 2 © a 3 X 3 © .... © a m X m © aX 

20 where row coefficients a 1( a 2 , a 3 a m and a* each take values of 0, 1 q-1 and © indicates symbol-wise addition 

modulo q. This base code is linear, i.e. the result of a symbol-wise addition modulo q on any two of its code words is 
another code word in the code. The corresponding generator array is created by combining the 2™ -symbol generator 
matrix with additional rows derived by symbol-wise multiplication modulo q of all possible pairs of the rows X 1( X 2 , X 3 , 
.... X m . Particular coset representatives derived from these additional rows in a specific way are each combined by sym- 

25 bol-wise addition modulo q with all code words in the 2 m -symbol base code to produce the required code words. 

[0039] In discussing the error detecting and error correcting properties of quaternary, octary and higher-order codes, 
the minimum Hamming distance alone does not fully describe the extent of possible errors affecting a code word. This 
is because an error in a single symbol can involve a transition through up to qfZ adjacent symbol values (where the val- 
ues q-1 and 0 are considered to be adjacent). The practical effect may be as serious as, or more serious than, two or 

30 more errors in different symbols each involving only a single transition to an adjacent symbol value. For example, a 
change of a single octary symbol from value 6 to value 1 (a transition through 3 adjacent values) may be more serious 
than a change of two octary symbols, one from 6 to 7 and another from 7 to 0 (both being transitions to an adjacent 
value). Therefore an additional useful measure is the minimum Lee distance, defined as the minimum number of sym- 
bol value transitions required to transform a valid code word into some other valid code word. In the binary case the 

35 Hamming and Lee distances are identical. 

[0040] The methods and apparatus for encoding and decoding COFDM signals as described in the above-referenced 
patent application provide a highly advantageous combination of error detection and correction properties, low PMEPR, 
convenience of implementation and useful rate of data transfer. As discussed in that patent application, in certain cir- 
cumstances it may be desired to modify the selection of code words to alter the trade-offs between these different prop- 

40 erties. 

[0041 ] For example, it may be desired to increase the available population of code words, in order to increase the data 
transfer rate, at the expense of accepting a higher maximum value of PMEPR. For example, it is possible to construct 
additional cosets of the base code and to derive the maximum peak envelope power (PEP) resulting from use of code 
words within that coset. This enables the cosets to be listed in ascending order of maximum PEP over each coset, so 
45 that cosets can be selected to provide some desired number of code words without yielding a PMEPR in excess of 
some desired threshold. However, identifying possible trade-offs between data transfer rate and PMEPR by construct- 
ing a list of coset representatives ordered by maximum PEP over each coset as descrtoed in the above-referenced pat- 
ent application, has the disadvantage, especially for longer code words and/or higher-order codes, of requiring 
extensive calculation. 

so [0042] The present invention identifies a set of coset representatives which can be combined with the base code to 
produce code words associated with a PMEPR no greater than a known value, without calculating the maximum PEP 
over each coset (although this technique does not provide any ordering by PMEPR for the cosets in the set). Although 
for convenience the invention will be described below with reference to binary, quaternary and octary codes (q=2. 4 and 
8 respectively), the invention can be applied to any code where the number q of possible symbols is an even integer. 

55 [0043] The technique will be illustrated rust in the case of a binary code of length 16 (/n=4), that is with a generator 
array 

(0000 0000 1111 1111) (X,) 
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(0000 1111 0000 1111) (X 2 ) 

(0011 0011 0011 0011) (X 3 ) 

(0101 0101 0101 0101) (X4) 

(1111 1111 1111 1111) PC) 

5 (0000 0000 0000 1111) (xrxa) 

(0000 0000 001 1 001 1) (xrX3) 

(0000 0000 0101 0101) PVX4) 

(0000 001 1 0000 001 1) (X 2 *X 3 ) 

(0000 0101 0000 0101) PVX4) 

10 (0001 0001 0001 0001) PVX4) 

As a first step towards producing the set of coset representatives, an initial set of eight (2 m " 1 ) expressions is con- 
structed, each having the form 

is where v, , v 2 and v 3 are coefficients having values of 0 or 1 and © indicates bitwise addition modulo 2. Thus the initial 
set is 

(X^X^epVXa) (v^O.v^.v^O) 
(x 1 *x 2 )e(x 2 *X3)e(x 1 *x 4 ) (v 1= i .v 2 =o,v 3 =o) 

20 (X^X^epC^X^epCg-XJ (v^O.Vgol.Va-O) 

(X! VXaW V^Wfe*^ (v 1= 1 ,v 2 »1 ,v 3 =0) 
(X^X^eCXz^WXa-X^ (v^O.v^.vaol) 
(X 1 *X 2 )©(X 2 *X 3 )®(X 1 *X4)©(X 3 *X4) (V!=1 .v 2 =0,v 3 =1) 

PVX^e^x^etx^x^epvx^ (vt=o.v 2 »i ,v 3 =i) 
25 (X^x^epc^XajetXi^je^^ecXa^xj (v 1= i ,v 2 =i t v 3 =i ) 

This initial set is then extended by applying all possible permutations of the subscripts of the row identifiers X t , X 2 . X 3 
and X4 in each expression. Thus, for example, applying the permutation % defined by n(1)=2, *(2)=3, *(3)=4, n(4)=1 to 
the subscripts of the row identifers in the expression (X^X^^X^) results in the expression (X 2 *X 3 )©(X 3 *X4). 

so [0044] A final set is produced by identifying the distinct expressions within the extended set; for this purpose, terms 
with the same pair of subscripts but in reverse order (such as X/X 2 and X 2 *X,) are regarded as being the same, and 
the ordering of such terms within an expression is disregarded. For example, applying the permutation *' defined by 
n'(1)=4. n'(2)=3. k\Z)=2, k'(4)=1 to the same expression (X/X^epc^Xa) results in the expression (XVXjepVXJ, 
which is not regarded as being distinct from (X^X^epvX*). 

35 [0045] The final set of expressions, forty-nine in all, defining the required set of coset representatives, is given in Table 
1 . in terms of six coefficients which constitute multipliers for the respective additional rows of the code's generator array 
which are added modulo 2 to create the corresponding coset representative; thus, for example, a 1 under the heading 
•1 2' indicates that the additional row (X, 'X^ is multiplied by 1 for inclusion in the coset representative, whereas a 0 indi- 
cates it is multiplied by 0 (i.e. effectively omitted). 

40 
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0 




1 


0 


1 


0 


0 


0 
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Table 1 (continued) 
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[0046] A code whose code words are obtained by combining any of these coset representatives with the base code 
has the following properties: 

20 

the code's minimum Hamming distance is 4 (2 m_2 ); 

a multi-carrier signal modulated in accordance with these code words has a PMEPR of no more than 6 dB; 
a decoder based on Hamming distance for use with such a code can operate using analytical techniques and can 
be implemented, for example, using combinatorial logic at least in part; 
25 - there are sufficient code words available to enable an acceptable data transfer rate to be attained. 

[0047] Figure 8 shows a 106/16B encoder 100 for encoding 10-bit data words as 16-bit code words selected in 
accordance with this inventioa This encoder includes five binary multipliers 102 to 1 10 for multiplying the five least sig- 
nificant bits of the input data word (treated as row coefficients a, . a 2 , a 3 . a* and a 1 ) by respective ones of the 1 6-bit gen- 

30 erator matrix rows X* X 3 , X 2 , Xt and X\ The five most significant bits of the input data word are supplied to a selector 
112 which selects one of thirty-two possible coset representative values in accordance with the combined value of 
those five bits. In the example shown in Figure 8 the first thirty-two of the forty-nine possible coset representatives listed 
in Table 1 have been chosen (arbitrarily) for use in the selector 112. Thus, for example, the f irst coset representative 
comprises the modulo-2 addition of the adcfitional rows (X^Xa) and (X 2 *X 3 ). that is (0000 0000 0000 1111)0(0000 

35 0011 0000 0011)=(0000 0011 0000 1100), and is arbitrarily assigned to the 5-bit selector input value 00000; the last 
coset representative in the selector 112, arbitrarily assigned to the input value 11111. comprises the modulo-2 addition 
of the additional rows (XrXa), (X/X^, PVX4) and (X3OQ). that is (0000 0000 0011 0011)0(0000 0000 0101 
0101)0(0000 0101 0000 0101)0(0001 0001 0001 0001)=(0001 0100 0111 0010). 

[0048] The outputs of the five binary multipliers 102 to 1 10 and of the selector 1 12 are combined by a bitwise exdu- 

40 sive-OR circuit 1 1 4 to produce the 16-bH code word. 

[0049] This code word can be decoded after reception in various ways. For example, K can be treated as a code word 
in the second order Reed-Muller code of length 16. RM(2,4). and decoded using known techniques for such a code 
word. As an alternative the "Supercede" decoding method described in 'Soft decoding techniques for codes and lat- 
tices, including the Golay code and the Leech lattice' by J.H. Conway and N.J.A. Sloane, IEEE Trans, inform. Theory, 

45 1986, vol.lT-32, no.1, pp. 41-50, may be used. Other decoders are described in the above-referenced patent applica- 
tion. 

[0050] In the general case of a binary code of length 2 m bits where m £ 3, the initial set of 2 m ' 1 expressions is con- 
structed with the form 

(X/Xa)©^^)©...©^^-^ 
50 where v 1 , v 2 . ... v m .^ are coefficients having values of 0 or 1 , and the permutation of subscripts and identification of dis- 
tinct expressions is performed as descrtoed above, to define a set of coset representatives which may be used in 
encoding (e.g. by incorporating some or all of them in the selector of an encoder similar to that shown in Figure 8). The 
number of cosets and the total number of code words identified by this procedure for various code word lengths (in 
terms of the parameter m) are given below: 
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m 


Co sets 


No. of code words 


3 


7 


112 i 


4 


49 


1568 


5 


552 


35328 


6 


7800 


998400 


7 


126360 


32348160 



[0051] A modification of this technique, applicable for word lengths of 32 or more bits (m £ 5), enables a large subset 
of these cosets to be identified, without the need for examining the extended set of expressions to find and discard 
expressions which are not distinct in other words, this modified technique can be guaranteed to produce an extended 
set of expressions which are inherently distinct To this end: 

- the combinations of values of the coefficients v, . v 2 , ... v m . 1t are limited to combinations such thatthe binary words 
of the form v 1 v 2 v 3 ...v m . 2 v m .i have Hamming weight of at least 4; and 

- the permutations of subscripts of the terms Xj to X m satisfy the condition that the subscript 1 is permuted to a value 
less than the value to which the subscript m-1 is permuted (i.e. jc(1)<7t(m-1)). 

Constructing an initial set of expressions with the form given above but with this limited set of combinations of coeffi- 
cients, and applying this limited set of permutations of the subscripts of the row identifiers in each expression, directly 
yields a final set of distinct expressions defining the required subset of coset representatives. The code words produced 
by combining the base code with any coset representative in this subset result in a PMEPR of at most 6 dB. Exactly 

coset representatives are identified by this procedure; in particular, at least ml.2 m - 3 coset representatives are identified 
by this procedure when m £ 8. 

[0052] An alternative modification, applicable for word lengths of 64 or more bits (m * 6), enables a smaller subset of 
cosets to be identified, again without the need for examining the extended set of expressions to find and! discard expres- 
sions which are not distinct and also without any restriction on the permutations *. To this end, the combinations of val- 
ues of the coefficients v 1 . v 2 , ... v m . t are limited to combinations such that the binary words of the form v 1 v 2 v 3 ...v /n .2 v /n. 
n have the following two properties: 

each word v 1 v 2 v 3 ...v m . 2 v m . 1 has Hamming weight of at least 4; and 

l(o-3)/2l l(m-a>/2] 

£ v M 2' < £ V W 2* 

i=0 1*0 



Constructing an initial set of expressions with the form given above but with this limited set of combinations of coeffi- 
cients, and applying all possible permutations of the subscripts of the row identifiers in each expression^, directly yields 
a final set of distinct expressions defining the required subset of coset representatives. The code words produced by 
combining the base code with any coset representative in this subset result in a PMEPR of at most 6 da At least mL2 m " 
4 coset representatives are identified by this procedure when m z 7. 

[0053] Similar techniques can be used tor identifying cosets of quaternary and higher-order code words which result 
in a known maximum PMEPR. For a quaternary code of length 2 m where m z 3, an initial set of 2 m " 1 expressions is 
constructed of the form 
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2(X 1 'X 2 )©2(X 2 -X 3 )©...©2(X m VXm-0 
where v 1f v 2 , ... v m . A , are coefficients having values of 0 or 1 and © indicates symbol-wise addition modulo 4. This initial 
set is then extended by applying all possible permutations of the subscripts of the row identifiers X 1t X 2 , X 3 , ... X m in 
each expression. A final set of expressions, defining the required set of coset representatives, is produced by identifying 
the distinct expressions within the extended set. as previously described for the binary case. Members of this set of 
coset representatives may be used in a selector of an encoder like that shown in Figure 8. but modified for operation 
with quaternary values. Thus, the coset representatives comprise symbols having possible values of 0 or 2; each gen- 
erator matrix row X 1f X 2 , X 3 , ... X m and X is multiplied by a quaternary value (representing, for example, the value of a 
respective pair of bits of an input binary word); and the products supplied by the multipliers are combined with the 
selected coset representative by symbol-wise modulo-4 addition. 

[0054] A quaternary code whose code words are obtained by combining any of these coset representatives with the 
quaternary base code has the following properties: 

- the code's minimum Hamming distance is 2 m ' 2 and its minimum Lee distance is 2 1 "' 1 ; 

a multi-carrier signal modulated in accordance with these code words has a PMEPR of no more than 6 dB; 

a decoder for use with such a code can operate using analytical techniques and can be implemented, for example. 

using combinatorial logic at least in part; 

- there are sufficient code words available to enable an acceptable data transfer rate to be attained. 

[0055] The number of cosets and the total number of quaternary code words identified by this procedure for various 
code word lengths (in terms of the parameter m) are given below: 



m 


Cosets 


No. of code words 


3 


7 


1792 


4 


49 


50176 


5 


552 


2260992 


6 


7800 


127795200 


7 


126360 


8281128960 



[0056] As in the binary case, for word lengths of 32 or more symbols (m z 5) a large subset of these cosets can be 
identified, without the need tor examining the extended set of expressions to find and discard expressions which are not 
distinct. This is accomplished by limiting the combinations of values of the coefficients v 1t v 2 . ... v m .\ and the permuta- 
tions of subscripts in the same way as described above for the binary case. Exactly 

coset representatives are identified by this procedure; in particular, at least ml.2 m " 3 coset representatives are identified 
by this procedure when m s 8. The code words produced by combining the quaternary base code with any coset rep- 
resentative in this subset yield a PMEPR of at most 6 dB. 

[0057] For word lengths of 64 a more symbols (m £ 6) a smaller subset of cosets can be identif ied, again without the 
need for examining the extended set of expressions to find and discard expressions which are not distinct, and also 
without any restriction on the permutations jl This is accomplished by limiting the combinations of values of the coeff i- 
cients v v v 2 . ... v m _ 1 to combinations such that the binary words of the form v 1 v 2 v 3 ...v m .2 v m-i nave ^e two properties 
specified in the discussion above tor the alternative modification in the binary case. For m £ 7 at least ml.2 m * 4 distinct 
coset representatives are identified by this procedure, and the code words produced by combining the quaternary base 
code with any coset representative in this subset yield a PMEPR of at most 6 dB. 

[0058] For quaternary code words of length 2 m where m 2 3, a further modification is possible, by allowing the coef- 
ficients v 1( v 2 , ... v m>1 to have any of the four values 0, 1, 2 or 3 and modifying the form of the expressions in the initial 
set to be 

2(VX 2 )©2(X 2 *X 3 )e...©2(X^ 2 -X m ^^ 
(i.e. without a multiplier of 2 before each coefficient). This initial set is then extended by applying all possible permuta- 
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tions of the subscripts of the row identifier X n . X 2 , X 3 , ... X m in each expression. A final set of expressions, defining the 
required set of coset representatives, is produced by identifying the distinct expressions within the extended set, as pre- 
viously described for the binary case. The number of cosets and the total number of quaternary code words obtained 
with this modification for various code word lengths (in terms of the parameter m) are given below: 



m 


Cosets 


No.of code words 


3 


37 


9472 


4 


661 


676864 


5 


14472 


59277312 


6 


360600 


5908070400 


7 


* 1290240 


* 84557168640 



It can be seen that many more cosets are available in this case, so there is a potential for a higher data transfer rate. 
The code words produced by combining the quaternary base code with any coset representative identified by this mod- 
ified procedure yield a PMEPR of at most 6 dB; however, the minimum Lee distance with this modification is reduced 
to2 m * 1 . 

[0059] An additional modification produces an extended set of expressions which are guaranteed to be distinct after 
permuting the subscripts of the row identifiers, by: 

- restricting the values of the coefficients , v 2 , ... v m>1 such that at least two of them have values other than 0 and 
2; and 

restricting the permutations of subscripts of the terms X 1 to X m to satisfy the condition that the subscript 1 is per- 
muted to a value less than the value to which the subscript m-1 is permuted (i.e. *(1)<7i(m-1)). 

For m £ 3, at least mltf*"' 5 distinct coset representatives are obtained with this approach. 

[0060] An alternative modif ication produces a smaller extended set of expressions which are guaranteed to be distinct 
after permuting the subscripts of the row identifiers without any restriction on the permutations tr, by fixing the values of 
the coefficients and v m0 to be 1 and 3 respectively while allowing the other coefficients to have any value from 0 to 
3 . For m 2> 3, at least ml.2 2m_€ distinct coset representatives are obtained with this approach. 
[0061] For an octary code of length 2 m where m £ 3, an initial set of 4 m l expressions is constructed of the form 

4(XrX2)©WX3)©...e4<X m .2%rM)^ 
where . v 2 . .. . v,^ are coefficients having values of 0, 1 , 2 or 3 and © indicates symbol-wise addition modulo 8. This 
initial set is then extended by applying all posstole permutations of the subscripts of the row identifiers X t , X 2 . X 3 , ... X m 
in each expression. A final set of expressions, defining the required set of coset representatives is produced by identi- 
fying the distinct expressions within the extended set as previously described for the binary case. Members of this set 
of coset representatives may be used in a selector of an encoder like that shown in Figure 8, but modified for operation 
with octary values. Thus, the coset representatives comprise symbols having possible values of 0, 2, 4 or 6; each gen- 
erator matrix row X 1t X 2 , X 3 ,... X m and X* is multiplied by an octary value (representing, for example, the value of a 
respective triplet of bits of an input binary word); and the products supplied by the multipliers are combined with the 
selected coset representative by symbol-wise modulo-8 addition. 

[0062] Octary code words obtained by combining any of these coset representatives with the base code have the fol- 
lowing properties: 

- the code's minimum Hamming distance is 2 m " 2 and its minimum Lee distance is 2 m ' 1 ; 

a multi-carrier signal modulated in accordance with these code words has a PMEPR of no more than 6 dB: 

- a decoder for use with such a code can operate using analytical techniques and can be implemented, for example, 
using combinatorial logic at least in part: 

- there are sufficient code words available to enable an acceptable data transfer rate to be attained; in general, this 
procedure identifies at least ml. 2m " 5 distinct coset representatives when m z 3. 

[0063] The number of cosets and the total number of octary code words for various code word lengths (in terms of 
the parameter m) are given below: 



13 



02/22/2000 09:22:22 page -13- 



EP 0 902 574 A2 



m 


Cosets 


No of code words 


3 


37 


151552 


4 


661 


21659648 


5 


14472 


3793747968 


6 


360600 


756233011200 


7 


2> 1290240 


>2.16x10 13 



[0064] A modification produces an extended set of expressions which are guaranteed to be distinct after permuting 
the subscripts of the row identifiers, by: 

- restricting the values of the coefficients v 1 , v 2 , ... v m .<, such that at least two of them have values other than 0 and 
2; and 

- restricting the permutations of subscripts of the terms X 1t to X m to satisfy the condition that the subscript 1 is per- 
muted to a value less than the value to which the subscript m-1 is permuted (i.e. ir(1)<jt(m-1)). 

For m £ 3, at least m!.2 2m * 5 distinct coset representatives are obtained with this approach. 

[0065] An alternative modification produces a smaller extended set of expressions which are guaranteed to be distinct 
after permuting the subscripts of the row identifiers without any restriction on the permutations n, by fixing the values of 
the coefficients v 1 and v m .<, to be 1 and 3 respectively while allowing the other coefficients to have any value from 0 to 
3. For m £ 3, at least ml.2 2/ "" 6 distinct coset representatives are obtained with this approach. 

[0066] For octary code words of length 2 m where m * 3, another modification is possible, by allowing the coefficients 
v 1( v 2 . ... v m ! to have any of the eight values 0 to 7 and modifying the form of the expressions in the initial set to be 

' 4(X n ; X2)©4<X 2 *X3)©...©4^^ 
(i.e. without a multiplier of 2 before each coefficient). This initial set is then extended by applying all possible permuta- 
tions of the subscripts of the row identifiers X, , X 2 , X 3 , ... X m in each expression. A final set of expressions, defining the 
required set of coset representatives, is produced by identifying the distinct expressions within the extended set, as pre- 
viously described for the binary case. The number of cosets and the total number of octary code words obtained with 
this modification for various code word lengths On terms of the parameter m) are given below: 




r m 


Cosets 


Naof code words 


3 


169 


692224 


4 


5917 


193888256 


5 


243912 


63940067328 


6 


>5.52x10 6 


>1.15x10 13 


7 


>3.09x10 8 


>5.19x10 15 



It can be seen that many more cosets are available in this case, so there is a potential for a higher data transfer rate. 
The code words produced by combining the octary base code with any coset representative identified by this modified 
procedure yield a PMEPR of at most 6 dB; however, the minimum Lee distance with this modification isreduced to 2 m " 2 . 
[0067] An additional modif ication produces an extended set of expressions which are guaranteed to be distinct after 
permuting the subscripts of the row identifiers, by: 

- restricting the values of the coefficients v lt v 2 , ... v m ^ such that at least two of them have values other than 0 and 
4; and 

- restricting the permutations of subscripts of the terms X, to X m to satisfy the condition that the subscript 1 is per- 
muted to a value less than the value to which the subscript m-1 is permuted (i.e. Ti(1)or(rr?-1)). 
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For m 7> 3, at least 18.ml.8 m * 3 distinct coset representatives are obtained with this approach. 
[0068] An alternative modification produces an extended set of expressions which are guaranteed to be distinct after 
permuting the subscripts of the row identifiers, without any restriction on the permutations n. by choosing the values of 
the coefficients v 1 and v^,^ to be any one of the following ordered pairs of values, where the value of is given first: 
5 1 ,2; 1 .3; 1 ,5; 1 ,6; 1 .7; 2,3; 2,5; 2,6; 2,7; 3.5; 3,6; 3,7; 5,6; 5,7; 6,7. The other coefficients may have any value from 0 to 
7. For m £ 3. at least 1S.fn\.8 m ' 2 distinct coset representatives are obtained with this approach. 
[0069] These techniques tor identifying cosets defining code words producing a PMEPR of no more than 6 dB can be 
extended to higher-order codes where q is an even integer. If the expressions in the initial set have the form 
(Q/2)(X 1 *X 2 )e(Q/2)(X 2 *X3)©...©(Q/2)(X m . 2 *X m . l ) 

10 

^ 1 {xrx m )ev 2 (x 2 *x m )©...©v m . 1 (x m . 1 *x /7) ) 

where v 1 , v 2 , ... v^ are coefficients having values of 0 to g-1 and © indicates symbol-wise addition modulo q, the 
code's minimum Lee distance is 2 m ' 2 . and arty even value of q is usable. If the expressions have the form 
is (Q/2)(X 1 *X 2 )®(Q/2)(X 2 *X3)e...®(Q/2)(X m . 2 *X m . 1 ) 

©2v 1 (XrX ni )©2V2(X 2 *X m )e...e2v m . 1 (X m .rX m ) 

where v 1t v 2 , . . . w m .^ are coefficients having values of 0 to (g/2)-1 , the code's minimum Lee distance is increased to 
20 2 m1 , but only values of q divisfoie by four are usable. 

[0070] More generally, large sets of coset representatives can be identified for codes containing 2 m symbols where 
m £ 3, each symbol having q possible values, for which the PMEPR is guaranteed to be at most 2 P * 1 . ( for any fixed 
integer p where 1 <; p £ m-2. To this end. p sets of coefficients are defined as follows 

25 (vi,i,v 1|2 . ...v 1in .p) 

(V 2 ,1.V 2 , 2 . --V 2>m .p) 

(v p ,i.v P(2 . ...v p m ^ 

where each coefficient can have an integer value from 0 to q-1. A further p(p-1 )/2 coefficients w,y, where 1 £ i<J £ p. 
30 are defined where each w^ can have an integer value from 0 to For an initial choice of the coefficients v^ and w^ 
an expression is constructed of the form 

<Q/2)(X 1 *X 2 )®(Q/2)(X 2 *X3)e...®(Q/2)(X m . p . 1 -X m . p ) 

35 ey/ 2(1 (x 1 *x m . p+2 )ev 2 ^p( 2 *x m .p^©...ev 2tm .ppc n ^ ) *x m . p ^ 2 ) 

©v P(1 (X 1 -X m )©v P(2 (X 2 *X m )©...© Ptm . p (X m . p -X m ) 
® w 1 ,2(X m .p + i *X m . p+2 ) 

.aCXm-p+i # X m . p+3 )©w 2f 3(X m . p ^ 2 *X /n . p+3 ) 

©*1 >4 (X m . p+1 # X fn . p ^ 4 )©W 2( 4(X m . p+1 *X m . p ^4)©W 3 , 4 {Xm-p+3*Xm-p+4) 

40 ©w 1 ^ m . p+1 -X OT )©w 2 ^ m . p ^ 2 *X m )©...©w p . 1 ^(X m . 1 *X m ) 

where © indicates symbol-wise addition modulo q\ further expressions of this form are constructed for the remaining 
possible combinations of choices of the coefficients v ? and w^, to obtain an initial set of QP (m " p ^ pCp * 1)/2 expressions. 
This initial set is then extended by applying all possible permutations of the subscripts of the row identifiers X 1( X 2 . X 3 , 

45 ... X m in each expression. A final set of expressions, defining the required set of coset representatives, is produced by 
identifying the distinct expressions within the extended set, as previously described for the binary case. The set of code 
words produced by combining the base code of order q with any coset representative identified by this procedure have 
a minimum Lee distance and a minimum Hamming distance both of 2 /n " 2 . In the case where q £ 4 and q is divisible by 
4, a further modification is possible in which the coefficients v^ and w, are restricted to have even values in the ranges 

so specified above. In this case fewer cosets will be identified, but the resulting code will have a minimum Lee distance of 

[0071] Another modification, tor p z m-4, produces an extended set of expressions which are guaranteed to be dis- 
tinct after permuting the subscripts of the row identifiers, by: 

55 - restricting the values of the coefficients v^- such that 
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and such that each word v/ iV/ 2 -. v/,/n-p-i v /,/n-p nas Hamming weight of at least 4; 
restricting the coefficients w,y for 1 £ /</ £ p to have values c72; 

restricting the permutations of subscripts of the terms X-j to X m to satisfy the condition that the subscript 1 is per- 
muted to a value less than the value to which the subscript m-p is permuted (i.e. 7c(1)<n(m-p)). 

If q £ 4 and q is divisible by 4, the coefficients v/y and v/g may additionally be restricted to have even values in the ranges 
specified above. In this case fewer cosets will be identified, but the resulting code will have a minimum Lee distance of 



[0072] The discussions above of quaternary codes have been expressed in terms of quaternary symbols having pos- 
sible values of 0, 1 . 2 and 3. For actual transmission, for example by quadrature phase-shift keying (QPSK) modulation, 
these values would correspond to four different values of phase shift One possibility is phase shifts of 0°, 90°, 1 80° and 
270° (equivalent to complex vectors 1, /, -1 and -/ respectively, where f*W-1). However, other variants are possible in 
which the phase shifts are. for example, 45°, 135°,225° and 315° (equivalent to complex vectors (1+/)/V2, (-1 +/)/>&. (- 
1 -/)/^2 and (1 -i)H2 respectively). This principle applies for codes with more symbol values (o>4, any even integer), and 
indeed in other applications the symbol values may be represented by any q distinguishable signal modulations; their 
amplitudes need not be equal, and, in the case of phase-shift keying, the phase differences between adjacent pairs 
need not all be equal. 

[0073] The code words in the codes defined above also have useful properties for establishing synchronization of tim- 
ing of circuit operation with a received signal. Accordingly, the circuitry for decoding these codes as described above 
can also be used to identify synchronization sequences in the received data, thereby providing economies in circuit 
implementation, ft is also envisaged that these codes have advantageous properties for communication by techniques 
other than COFDM; thus, for example, they may be used as codes for code division multiple access (CDMA) techniques 
or for modem transmission of digital data over analogue communication paths. 



1 . A method of encoding data words as 2 m -symbol code words where m £ 3, each code word symbol having q differ- 
ent possible values where q is an even integer greater than 1, for transmission using multi-carrier frequency divi- 
sion multiplexing, such as coded orthogonal frequency division multiplexing (COFDM), wherein data words are 
encoded in accordance with their value and code words selected in complete cosets, for a peak-to-mean envelope 
power ratio of said transmission not greater than a predetermined threshold of 2?* 1 for any fixed integer p where 1 
£ p £ m-2, from a set of one or more cosets of a linear sub-code of a code having a generator matrix as follows: 

((2.0+1 )x2 m )/2 m ((2.1+1)x2 m )y2 /n ((2.z+1)x2 m )y2' 7, .. ((2.(2 m -1)+1)x2 /7, )/2' n 

((2.0+1)x2 m - 1 y2 m - 1 {(2.U1)x2 m - 1 )/2 m - 1 «2.z+1)x2 m ' 1 )/2 m * 1 .. ((2.(2 m -1)+1)x2'"- 1 )/2 /n - 1 

((2.0+1 )x2J0/2 y ((2 J+1)x20y2^ .. ((2.z+1)x2")/2 x - {{2.(2 m -W)y&)t2 y 

((2.0+1)x2 1 )y2 1 ((2.1+1)x2 1 )/2 1 .. ((2.z+1)x2 1 )/2 1 .. ((2.(2'"-1)+1)x2 1 )/2 1 
((2.0+1 )x2°)/2 0 ((2.1+1)x2°)y2° .. ((2.z+1)x2°)/2 0 .. «2.(2' 7, -1)+1)x2 ( V2 0 

where z has integer values from 0 to 2 m -1, y has integer values from m to 0, x indicates bitwise multiplication of 
factors expressed as (m+1)-bH binary numbers, and the result of division is expressed as a single digit; 

wherein all coset representatives of cosets in said set of cosets have the form 

(qy2)(X l *X 2 )e(q/2)(X 2 *X3)©...©(Qy2)(X in . p . 1 *X m . p ) 

®Vl l l(Xi*X m .p 4>1 )®V 1 ^(X2*X m . p>1 )®- ,ro-p(X/n-p*Xro-p*l) 
® v 2.l( X r X m-p+2)® v 2,2( X 2*X m .p + 2)® —& / 2,m*p&m-p*Xm>p+2) 

©^p,i(xrx /71 )e Vp(2 (x 2 *x m )©...©v pm . p (x m yx n ,) 

®*h ^(Xnrvp+I *X m .p+2) 

®W 1 t 3(X m . p+1 *X m . p+ 3)©W2 ( 3(X m _p+2%7>-p+3) 

f 4P^m-p+r X m-p^)© w 2,4( X m-p+2* X m-p+4)® w 3,4P^fn-p-3* X /n-p+4) 

^(X/n-p+l *X^)©W 2t p(X m . p+ 2*X m )©...©Wp. 1 ^ I ( X m-r X m) 

or a form derived therefrom by a permutation of the subscripts of the terms X, to X m , where X, to X m are row labels 
of the first m rows in the generator matrix, terms of the form v p>m . p (X m .p*X n? ) indicate derivation of a word by sym- 



Claims 
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bol-wise multiplication of rows such as X m . p and X„, followed by symbol-wise multiplication moduto q of sue! • word 
by a respective coefficient such as v Pt/n _p © indicates symbol-wise addition of words modulo q. and the coefficients 
Vy for 1 s / s" p and 1 £ / £ m-p and the coefficients w,y tor 1 £ i<j s p have integer values chosen from the range 0 
toq-1. 

2. The method of claim 1 , wherein m is at least 4. 

3. The method of claim 1 or claim 2, wherein q is divisible by 4 and the coefficients v,y for 1 £ / s p and 1 £ y £ m-p 
and the coefficients w^ tor 1 £ k/ € p are chosen to be even. 

4. The method of any one of the preceding claims, wherein p=1 and all coset representatives of cosets in said set of 
cosets have the form 

(o72)(X/X 2 )(^gy2)(X 2 0(3)e...<^ 
where v 1 =v 1v V2=v 12 v m . lC v 1m . v 

5. The method of claim 4. wherein q=2. 

6. The method of claim 3, wherein p=1 and all coset representatives of cosets in said set of cosets have the form 

(q/2)(X 1 -X 2 )©(g/2)(X 2 *X3)©...e(Q/2)(X m . 2 *X m . 1 ) 

©2v l (X 1 *X m )©2v 2 (X 2 *X m )©...©2v m . 1 {X m . 1 *X m ) 
where the coefficients v 1 *vi,i/2, v 2 =v 12 /2 v^-Vi^A have values in the range 0 to (q/2)-1. 

7. The method of claim 6, wherein g=4. 

8. The method of claim 5 or claim 7. wherein mz5 and: 

the combinations of values of coefficients v 1( v 2 , ... v m>1 are limited to combinations such that binary words of 
the form v 1 v 2 ...v m4 v m . 1 have the property that each word v 1 v 2 ...v m . 2 v m . 1 has Hamming weight of at feast 4; 
and 

said permutations of subscripts of the terms X 1 to X m satisfy the condition that the subscript 1 is permuted to 
a value less than the value to which the subscript m-1 is permuted. 

9. The method of claim 5 or claim 7, wherein m i 6 and the combinations of values of coefficients v 1( v 2 . ... v,,,.-, are 
limited to combinations such that binary words of the form v 1 v 2 ...v m . 2 v m . 1 have the following two properties: 

each word v 1 v 2 ...v /n . 2 v m . 1 has Hamming weight of at least 4; and 



10. The method of claim 4, wherein and: 

the combinations of values of coefficients v 1p v 2 , ... v m>1 are limited to combinations such that at least two of 
said coefficients have a value other than 0 and q!2\ and 

said permutations of subscripts of the terms X t to X m satisfy the condition that the subscript 1 is permuted to 
a value less than the value to which the subscript m-1 is permuted. 

1 1 . The method of claim 4, wherein q £ 4 and: 

the combinations of values of coefficients v v v 2 , ... v m _, are limited to combinations such that neither of the 
coefficients and v m .i has a value of 0 or g/2; and 
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the coefficient v 1 is always chosen to be fess than the coefficient v m . v 
12. The method of claim 6. wherein q * 8 and: 

5 the combinations of values of coefficients v 1§ v 2 , ... v m .., are limited to combinations such that at least two of 

said coefficients have a value other than 0 and q/4; and 

said permutations of subscripts of the terms X t to X m satisfy the condition that the subscript 1 is permuted to 
a value less than the value to which the subscript m-1 is permuted. 



10 1 3. The method of claim 6. wherein q £ 8 and: 



the combinations of values of coefficients v lf v 2 . ... v m-1 are limited to combinations such that neither of the 

coefficients and v m .i has a value of 0 or q/4; and 

the coefficient v t is always chosen to be less than the coefficient v m . 1# 

15 

14. The method of claim 1 or claim 2 or ctaim 3. wherein p £ m-4 and: 



20 
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m-p 



y- 1 



and 



25 each word v /f 1 v/ <2 ...v /tm .p. 1 v /tm .p has Hamming weight of at least 4; and 

the coefficients w f ; for 1 £ i<j Z p have values q/2; and 

said permutations of subscripts of the terms X, to X m satisfy the condition that the subscript 1 is permuted to 
a value less than the value to which the subscript m-p is permuted. 
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